PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA01g19070
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 836aa    MW: 91074.1 Da    PI: 6.1456
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA01g19070genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.66.9e-21132187156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  CA01g19070 132 KKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 187
                 688999***********************************************999 PP

2START202.61.7e-633395631206
                 HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEEC CS
       START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevis 86 
                 ela++a++elvk+a+ +ep+W++s     e +n++e++++f++  +     + +ea+r++g+v+ ++  lve+l+d++ +W e+++    + +t++vis
  CA01g19070 339 ELALAAMDELVKMAQTDEPLWFRSMegarEVLNQEEYMRTFTPCIGmrpntFVSEASRETGMVIINSLALVETLMDSN-KWAEMFPcliaRTSTTDVIS 436
                 5899*********************9****************9999********************************.******************** PP

                 TT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXH CS
       START  87 sg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlph 178
                 sg      galqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvS+d  ++ +  + +  +++lpSg+++++++ng+skvtwveh+++++   h
  CA01g19070 437 SGmggtrnGALQLMHAELQVLSPLVPiREVNFLRFCKQHAEGVWAVVDVSIDTIRETSAAPTFQSCRRLPSGCVVQDMPNGYSKVTWVEHAEYDEGANH 535
                 ********************************************************9999*************************************** PP

                 HHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
       START 179 wllrslvksglaegaktwvatlqrqcek 206
                  l+r+l++ g+ +ga++wvatlqrqce+
  CA01g19070 536 HLYRQLISAGMGFGAQRWVATLQRQCEC 563
                 **************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.05E-20119189IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.8E-21119189IPR009057Homeodomain-like
PROSITE profilePS5007117.248129189IPR001356Homeobox domain
SMARTSM003892.2E-17130193IPR001356Homeobox domain
CDDcd000861.92E-18131189No hitNo description
PfamPF000462.0E-18132187IPR001356Homeobox domain
PROSITE patternPS000270164187IPR017970Homeobox, conserved site
PROSITE profilePS5084843.544330566IPR002913START domain
SuperFamilySSF559612.47E-31332563No hitNo description
CDDcd088757.02E-124334562No hitNo description
SMARTSM002347.9E-47339563IPR002913START domain
PfamPF018523.5E-55339563IPR002913START domain
SuperFamilySSF559619.07E-24582762No hitNo description
SuperFamilySSF559619.07E-24789820No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 836 aa     Download sequence    Send to blast
MNFGGFLDNN SGGGGARIVA DTPFNNNSSS GNNKSSNNNN NDNNMPTGAI SQPRLLPQSL  60
AKTMFNSPGL SLALQTGMEG QNDVTRMGEA YEGNNSVGRR SREEEPDSRS GSDNLEGASG  120
DEQDAADKPP RKKRYHRHTP QQIQELESLF KECPHPDEKQ RLELSKRLCL ETRQVKFWFQ  180
NRRTQMKTQL ERHENSILRQ ENDKLRAENM SIREAMRNPI CTNCGGPAMI GEISLEEQHL  240
RIENARLKDE LDRVCALAGK FLGRPISSLV TSMPPPMPNS SLELGVGSNG FGGLSNVPTS  300
LPLAPPDFGV GISASMPVVP STRQTSGIER SLERSMYLEL ALAAMDELVK MAQTDEPLWF  360
RSMEGAREVL NQEEYMRTFT PCIGMRPNTF VSEASRETGM VIINSLALVE TLMDSNKWAE  420
MFPCLIARTS TTDVISSGMG GTRNGALQLM HAELQVLSPL VPIREVNFLR FCKQHAEGVW  480
AVVDVSIDTI RETSAAPTFQ SCRRLPSGCV VQDMPNGYSK VTWVEHAEYD EGANHHLYRQ  540
LISAGMGFGA QRWVATLQRQ CECLAILMSS TVTSRDYTAI TPSGRRSMLK LAQRMTNNFC  600
AGVCASTVHK WNKLCAGNVD EDVRVMTRKS VDDPGEPPGI VLSAATSVWL PVSPQRLFDF  660
LRDERLRSEW DILSNGGPMQ EMAHIAKGQD HGNCVSLLRA SAMNANQSSM LILQETCIDA  720
AGALVVYAPV DIPAMHVVMN GGDSAYVALL PSGFSIVPDG PGSRGSSTGN CGSNGPPSCN  780
GGPDQRISGS LLTVAFQILV NSLPTAKLTV ESVETVNNLI SCTVQKIKGA LHCES*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankGQ2221850.0GQ222185.1 Solanum lycopersicum cultivar M82 cutin deficient 2 (CD2) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016575742.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLM1CNN30.0M1CNN3_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000713770.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA16262465
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein